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ABSTRACT 



A document imaging system detects skew and/or size of a 
document. In one embodiment, a document imaging system 
generates scanning signals representing the documents, ana- 
lyxes the scanning signals to detect one more edges of the 
document before the entire lenght of the document is 
scanned, establishes a skew angle between the detected 
edges and a reference orientation, and modifies the scanning 
signals to compensate for skew while the document is being 
scanned. In another embodiment, a document imaging sys- 
tem detects one or more edges of a document, defines a 
polygon having sides substantially congruent with the 
detected edges, and establishes the size of the document in 
response to the polygon. 

20 Claims, 9 Drawing Sheets 
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METHOD AND APPARATUS FOR NEAR 
REAL-TIME DOCUMENT SKEW 
COMPENSATION 

CROSS-REFERENCE TO RELATED 
APPLICATION 

This application is a continuation of U.S. patent applica- 
tion Ser. No. 08/170,125 filed Dec. 20, 1993, now U.S. Pat. 
No. 5,818,976 which is incorporated by reference in its 
entirety and which is a continuation-in-part of U.S. patent 
application Ser. No. 08/143,218 filed Oct. 25, 1993, now 
abandoned. 

TECHNICAL FIELD 

The present invention relates in general to optical scan- 
ning and image processing, and relates more particularly to 
a document imaging system which detects skew and/or 
size/shape of a document image, and for compensation of 
skew. 

BACKGROUND ART 

Document imaging technology is used in a growing 
number of business and other applications such as facsimile 
or "fax" machine transmissions, optical character recogni- 
tion (OCR), the digitizing of photographs and artwork, and 
photocopy machines. In many of these applications, best 
results are usually obtained if document contents are prop- 
erly aligned with an optical sensor used to generate an image 
of the document and if the size and/or shape of the document 
can be established. 

If a document page is misaligned with respect to the 
optical sensor, the resultant image is similarly skewed. 
Because the contents of a document page are usually aligned 
with the page itself, a skewed page usually results in a 
misalignment with the optical sensor. Misalignment can 
reduce the amount of data compression achievable by fax 
machines and can increase the error rate of OCR processes. 
As a result, the ease with which such systems may be used 
is impaired because operators must take care to ensure that 
the medium is reasonably well aligned with the optical 
sensor. 

Skew compensation in optical systems such as those 
disclosed in U.S. Pat. Nos. 5,027,227 and 5,093,653 are 
unsatisfactory because they require operator input to estab- 
lish the "skew angle" or the amount of image rotation 
required to compensate for skew. A system disclosed in U.S. 
Pat. No. 4,953,230 does not require operator input, but it 
relies upon the existence of text or other marks on the page 
to establish the orientation of the page. Other disadvantages 
of these systems include requiring large amounts of memory 
to store the image while the skew angle is established and 
imposing a considerable delay after scanning before skew is 
compensated. Furthermore, the skew compensation tech- 
niques disclosed in these patents severely distort the image 
unless the skew angle is small. 

In many applications, document images are either trans- 
mitted immediately or stored for later use. Transmission- 
channel bandwidth and storage capacity are required to 
convey portions of the scanned image outside the edges of 
the scanned document unless the size/shape of the document 
can be established. This bandwidth or storage capacity is 
essentially wasted because this portion of the image does not 
convey useful information about the contents of the docu- 
ment. It is, therefore, desirable for a document imaging 
system to establish the size/shape of pages in a document so 
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that the required bandwidth required to transmit, or the 
storage capacity required to store, a document image is 
minimized. 

A method and a device are needed for automatic skew 
5 compensation and for automatic size and/or shape detection. 

DISCLOSURE OF INVENTION 

It is an object of the present invention to provide auto- 
matic skew detection in a document imaging system. 

It is an object of the present invention to provide auto- 
matic skew compensation in a document imaging system. 

It is an object of the present invention to provide skew 
compensation as a document is scanned, thereby reducing 
15 the amount of memory required to store scanning data while 
skew is compensated and reducing the delay before portions 
of the deskewed document image can be processed. This is 
referred to herein as "near real-time" skew compensation. 
It is yet another object of the present invention to provide 
20 for a document imaging system which establishes the size 
and/or shape of a scanned document. 

These and other objects which are discussed throughout 
this description are achieved by the invention as claimed. 
In accordance with the teachings of one aspect of the 
25 present invention, one embodiment of a document imaging 
system generates scanning signals in response to optical 
characteristics of a medium such as a sheet of paper against 
a background with contrasting optical characteristics. 
Orientation, size and/or shape of the medium are established 
30 in response to transitions in the scanning signals correspond- 
ing to transitions between contrasting optical characteristics 
of the medium and the background. 

In accordance with the teachings of another aspect of the 
present invention, one embodiment of a document imaging 
35 system generates scanning signals in response to optical 
characteristics of a medium, records points in response to 
transitions in the scanning signal corresponding to an edge 
of the medium, fits a line to the points, establishes a skew 
angle between the fitted line and a reference orientation, and 
40 compensates for skew by modifying the scanning signals as 
required such that the resultant skew angle is substantially 
equal to zero. 

In another embodiment, a document imaging system 
4S establishes a measure of confidence in the line fit by calcu- 
lating the error between the line and the recorded points, and 
compensates for skew only if the measure of confidence is 
sufficiently high. 

In yet another embodiment, a document imaging system 
50 fits more than one line to points corresponding to more than 
one edge of the medium, establishes a measure of confidence 
in the fit of each fine, selects the line with the highest 
measure of confidence, establishes a skew angle between 
that line and a corresponding reference orientation, and 
55 compensates for the established skew angle. 

In another embodiment, a document imaging system 
ensures that the recorded points span a minimum distance 
before attempting to detect an edge by fitting a line to the 
points. 

60 In accordance with the teachings of yet another aspect of 
the present invention, one embodiment of a document imag- 
ing system detects one or more points in an image corre- 
sponding to one or more edges of a document, establishes a 
closed contour in response to the one or more points, defines 

65 a polygon having sides tangential to the closed contour, and 
establishes the size/shape of the document in response to the 
polygon. 
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In another embodiment, a document imaging system invention is illustrated in FIG. 1. Optical sensor 2 generates 
detects the points in a low-pass filtered representation of the scanning signals representing an image of a medium such as 
image. In yet another embodiment, a document imaging a document page. References to "page" herein should be 
system defines a polygon which is a rectangle. understood to refer generally to a suitable medium carrying 
Features of the various embodiments mentioned above 5 information which is printed, written, drawn ^or oth lerwise 
u j-cjj/ u--i*p *u u,r conveyed optically. Skew compensator 4 analyzes the scan- 
may be modified ao^or combined to form other embodi- s f gna i^ detects and calculates a skew angle between an 
mem* Aspects ot tbe present tnvenuon and its preferred ^ * ^ ^ Kknn(x orienUdoD( and modifics 
embodunents may be better understood by referring to the F si | nals M ^ ltd ^ (hal me resu]tant skew 
foltowmg discussion and the accompanying drawings in substantially equal to zero. Size/shape detector 6 
which like reference numerals refer to like elements id the 1U , iL • • t j *■ * u j 
*»uitu i^. 6 ™ u . , . . M1 . analyzes the resulting scanning signals and estimates bound- 
several figures. The embodiments set forth in tbe following . fU . J!™™ a A„^ t u* ~™ tk 0 
- j .ij- jjl r anes in the image corresponding to edges ot the page. Ine 
discussion and in the drawings are provided by way of t . f 4l _ . f t .. * . ° t , . , . 

, , . , ^ j . J , portion of the image within the estimated boundaries is 

examples only and should not be understood to represent \ . . . t 0 

: r . ' _ . . r stored m image storage o. 

limitations upon the scope of the present invention. _ t t 

r r r 15 . Other embodunents of a document imaging system are 

BRIEF DESCRIPTION OF DRAWINGS possible. For example, an embodiment comprising optical 

sensor 2, skew compensator 4 and image storage 8 provides 

FIG. 1 is a functional block diagram of a document for skew detection and compensation. An embodiment com- 

im aging system incorporating various aspects of the present prising optical sensor 2, size/shape detector 6 and image 

invention. 20 storage 8 provides size/shape detection. Various embodi- 

FIG. 2 is a schematic representation of peripheral devices ments of skew compensator 4 and size/shape detector 6 are 

which may used in a computing system which implements discussed in more detail below. Image storage 8 is not 

a document imaging system. required to practice any aspect of the present invention but 

FIG. 3a is a flowchart representing one method of skew represents a component which uses the image representation 

compensation according to various aspects of the present 25 Passed either by skew compensation 4 or by size/shape 

invention detector 6. As an alternative to image storage 8, for example, 

FIG. ib is a flowchart representing one method of edge s^/shape Rector 6 may pass an image representation to a 

detection and skew angle calculation according to various component transmission or for display, 

aspects of the present invention. °P tical 2 ma y implemented in a number of 

_ _ . . „ . ^ .. ... r 30 ways using a variety of sensing elements. No particular 

FIG. 4a is a flowchart representing another method of c fc critic ^ t0 the cXic& of the reS ent mveD . 

skew compensation according to various aspects of the tion; a prefgrrcd embodiment ^ a ^ 

present invention. sensor (CIS) and a mechanism capable of moving the page 

FIG. 46 is a flowchart representing another method of rc i at ivc to the CIS. This implementation is generally pre- 

edge detection according to various aspects of the present 35 f crrec j because of its relatively low cost, small size and 

invention. ruggedness. Alternative embodiments include flat-bed scan- 

F1G. Sis a hypothetical graphical representation of a page ners in which a mechanism moves the sensing element 

image. relative to the page and hand-held scanners in which the 

FIGS. 6-8 are hypothetical representations of magnified sensing element is moved by hand relative to the page. In 

views of image pixels in the vicinity of a page corner. 40 principle other types of sensing elements can be used such 

FIGS. 9a-9d are schematic representations of page ^ for example, a linear charge-coupled device (CCD) a 

images having various skew angles. two-dimensional CCD array, or an image orthicon tube. If a 

T^c-iAn L *t_.-i u-i * c two-dimensional sensing arrangement is used, the page 

FIGS. 10-11 axe hypothetical graphical < representations of should held M "* ^ ^ tQ * e 

apage unage enclosed within a polygonal approximation of ^ ^^ element . Altcrnati ; e , y>aflyill g S p 0tscann i ngsvstem 

e page e ges. can ^ em pi 0 y ec i The present invention contemplates any 

MODES FOR CARRYING OUT THE means for generating a signal which corresponds to a 

INVENTION two-dimensional image representation of the page. Either 

transmissive or reflective optical techniques may be used. 
FIG. 5 is a hypothetical graphical representation of a 50 W hen reflective techniques are used, preferred embodi- 
document image as it might be generated by a document ments incorporate a scanning background with optical char- 
imaging system without skew and size/shape detection and acteristics which contrast with the optical characteristics of 
skew compensation. Tbe image within frame 100 comprises me pages l0 ^ scanned by the optical sensor. For example, 
page image 102, which is skewed with respect to the frame, jf me pag^ are conventional white sheets of paper, the 
and area 104 outside the edges of the page image. Document 5S background may be black or a shade of grey. Grey shading 
imaging systems incorporating various aspects of the present fr om approximately, thirty percent to one hundred percent 
invention directed toward skew detection and compensation (black) usually provide sufficient contrast with white sheets 
attempt to generate an image in which page image 102 is 0 f papcr jf wn j tc ^ black pages are to be scanned, the 
oriented with respect to frame 100. Document imaging background may be an intermediate shade of grey such as 
systems incorporating various aspects of the present inven- 60 fifty percent. If the optical sensor is sensitive to color, the 
tion directed toward size/shape detection attempt to generate background may be a contrasting color. The background 
an image in which frame 100 is substantially coincident with may ^ comprise a gradation in shading or color, or a 
the edges of page image 102. pattern of lines, dots, or other shapes of one or more shades 
. and/or one or more colors which contrast with optical 
asic rue ure g5 cnar acteristics of the pages; the edges of a page can be 
The basic structure of one embodiment of a document established by detecting transitions in the scanning signal 
imaging system incorporating various aspects of the present corresponding to transitions between the background and the 
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page. If the contrast between portions of an edge and the reduced. Jn a preferred embodiment, scanning data corn- 
background is not sufficient to allow detection of transitions prises binary pixels indicating whether corresponding por- 
in the scanning signal, then those portions can be established tions of the scanning signal represent "white" or "black" 
by interpolating and/or extrapolating information from tran- images. A pixel value equal to one corresponds to a black 
sitions which could be detected. 5 and a P^ 1 value ec l ual t0 negative one corresponds 
w a t . , c i*u 4*-ui to a white image. These values are convenient choices for 
In flat-bed scanners, for example, the contrastmg back- 4 . »• i i &u a u a 
, « . , i-j A. _r i j the particular non-recursive low-pass falters described 
ground may be provided by a lid or other surface placed over ^ Qther ^ Qf bin yalues ^ ^ 0Qe and zerQ 

the page to be scanned, or the background may be provided be ^ m Qther embodiments . 

by the absence of a lid or other surface in proximity to the Iq alternative embodiments, scanning data comprises pix- 
scanned page. In feed-through or roUer type scanners me 10 clswilhd ^ tolv ^ rep ^ ntipgdSOTteshadesof p eyor 
contrasting background may be provided by the roller. With color ^ prescnt invcntion may be ^cd t0 dctect aad 
hand-held scanners, the contrastmg background may be compensate skew in such grey-scale images; however, edge 
provided by a mat or other surface behind the page being detection, discussed below, is simplified by using binary- 
scanned. Many other implementations are possible. valued pixels. This may be accomplished by comparing the 
Specifications for the optical sensing element are influ- 15 digital values to a threshold to establish binary elements 
enced by a variety of design choices including desired image indicating whether the pixel corresponds to a "light** or 
resolution and size of the page to be scanned. If a linear "dark" image. For ease of discussion, the following descrip- 
sensing element is used, the relative velocity between the uon assumes the scanning data comprises binary pixels, 
scanning element and the page is another consideration. In LIMIT 26 establishes a position in the scanning data for 
one preferred embodiment, optical sensor 2 comprises a CIS 20 a given page after which skew compensation will not be 
with 1728 elements scanning along a line 8.5 inches in attempted if a skew angle has not yet been established. This 
length, thereby providing a horizontal resolution of approxi- feature minimizes the amount of memory required to tem- 
mately 200 picture elements or "pixels" per inch. The CIS is porarily store scanning data, and also passes image data to 
scanned at a rate of 400 Hz and the page is moved forward a subsequent process in near real-time even if skew com- 
approximately 0.005 inches between scans, thereby provid- 25 pensation is not possible. 

ing a vertical resolution of approximately 200 samples per Scanning data is held in a buffer while an attempt is made 

inch. The image resolution and scanning speed are mere to calculate the skew angle. If the skew angle is calculated 

design choices and are not critical to the practice of the successfully, the scanning data in the buffer and scanning 

present invention. ^ data received subsequently is modified to compensate for 

FIG. 2 provides a schematic representation of keyboard skew, and the modified data is passed to a subsequent 

10, scanner 12 and display 14 used in one embodiment of a process. In the embodiment shown in FIG. 1, for example, 

microprocessor-based computing system which implements the subsequent process is size/shape detector 6. If the skew 

a document imaging system. Scanner 12 represents a device angle cannot be calculated successfully, the data in the buffer 

which incorporates optical sensor 2, and may also incorpo- 3S and the scanning data received subsequently for this page 

rate skew compensator 4 and/or size/shape detector 6. Func- are passed unaltered to a subsequent process. LIMIT 26 

tions not performed by scanner 12 may be performed in a establishes the position in the scan of a given page at which 

host computer, not shown, attached to the peripherals illus- skew angle calculations are abandoned and the buffered data 

trated in FIG. 2. Although it is contemplated that much of the begins to pass unaltered. 

processing described below is implemented by software, ^ Various criteria which may be used to establish this 

some or all of the processing could also be implemented by position include, for example, a maximum number of 

hardware. The present invention may be implemented by a recorded points provided to edge detection, a physical 

wide variety of hardware and software combinations. dimension across the page, or maximum size of the buffer 

available to hold scanning data. In one embodiment using 

Skew Detection and Compensation ^ mc optical sensor described above, the limit is set at sixty- 

The flowchart in FIG. 3a illustrates steps in one method four scanning lines, 

of skew detection and compensation according to various EDGES 28 analyzes scanning data to detect one or more 

aspects of the present invention. Many variations are pos- edges of the page and attempts to calculate the angle 

sible. Some of these variations are discussed below. An between one of the edges and a reference orientation. Any 

alternative method is discussed after the first method is 50 reference orientation may be used with the present 

described. invention, but common choices are the principal axis of the 

INIT 22 ensures that various elements are prepared for the optical sensing element and a line orthogonal to that axis 

following steps. -For example, optical sensor 2 may be which is parallel to the plane of the page, 

placed into a known state to begin scanning a page. In Edge detection may look for one or more edges. For 

embodiments utilizing software, memory may be allocated ss example, edge detection may look only for the left edge of 

and data work areas are set to initial values. In particular, for the page, or it may look for the left, top and right edges, 

the embodiment discussed below, initialization includes Edge detection in one embodiment, described in more detail 

clearing a buffer of all scanning data and setting indications below, looks for two edges adjacent to a comer of the page 

that a skew angle has not yet been calculated, a scanning which are referred to as a "left" edge and a "right" edge; 

limit has riot yet been reached, the end of the page has not 60 however, one of these edges usually corresponds to the top 

yet been reached, scanning is not yet done, and no points edge of the page. The top edge cannot be identified until the 

corresponding to page edges have been recorded. skew angle is established. 

SCAN 24 provides scanning data in response to scanning BUFFER 30 temporarily stores scanning data in the 

signals received from optical sensor 2. Preferably, SCAN 24 buffer. The size of the buffer required to hold this data is 

receives scanning signals from an optical sensor in incre- 65 affected by the operation of LIMIT 26, described above, 

ments representing a single line of the image scan so that the SKEW 32 inquires whether a skew angle has been suc- 

a mount of memory needed to store the scanning data is cessfully calculated for this page. If it has, scanning data in 
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the buffer is processed by ROTATE 34. If it has not and skew clockwise direction. The upper right corner of the page is the 

compensation is abandoned because the limit set by LIMIT first part of the page which is scanned by optical sensor 2. 

26 has been exceeded, then scanning data is processed by The edge to the left of this comer is actually the top edge of 

PASS 36. Otherwise, ±e scanning data is retained in the the page, but it is referred to as the "left" edge during edge 

buffer and END 38 is performed next. After the skew angle 5 detection processing. If the page were skewed by a greater 

calculation is either successful or abandoned, image data amount, say fifty-five degrees counter-clockwise for 

may be passed to a subsequent process in near-real time by example, edge detection would identify the edge adjacent to 

either ROTATE 34 or PASS 36. the right margin of the page as the top edge. 

ROTATE 34 applies a rotational transform to scanning If the top edge of the page happens to be perfectly aligned 

data in the buffer so that an image corresponding to the 1° with the optical sensing element, the two detected edges are 

transformed data is rotated with respect to an image corre- not adjacent to one comer but instead are adjacent to 

sponding to the scanning data as received from SCAN 24. opposite comers of the top edge. This unlikely occurrence 

The magnitude of the rotation angle is substantially equal to does not pose any difficulty for the method described below, 

the skew angle calculated by EDGES 28 but is opposite in In one embodiment, edge detection analyzes a low-pass 

sign. As a result, the rotational transform compensates for 15 filtered representation of the scanning data to minimize the 

skew. The results of the transform are passed to a subsequent adverse effects of factors such as noise in the optical sensor, 

process such as size/shape detector 6. small chips of paper within the view of the sensor, and 

The center of rotation is not critical to the practice of the random marks on the page. The filter may be recursive or 

invention. In the preferred embodiment, the center is estab- non-recursive, analog or digital, implemented by a variety of 

lished at one of the upper comers of frame 100 (see FIG. 5). 20 techniques. No particular filter implementation is critical to 

This is described in more detail below. the practice of the invention; however, one embodiment 

PASS 36 takes scanning data from the buffer and passes comprises two 10-tap non-recursive ^filters. In one filter, used 

it to a subsequent process without applying a rotational to detect " left " the "4^"* ta P s h * ve a 

transform. PASS 36 is performed only when attempts to „ coefficient equal to one and the last five adjacent taps have 

calculate a skew angle have been abandoned. • coefficient equal to negative one. The second filter usedto 

i-vrr^o • u *u *u j *.u • a * u detect the "right" edge, is a mirror of the first filter. The 

END 38 inquires whether the end of the scanning data has , . . .. % u . a j u . c 

. , ir „ _ . . . ° 4 , . characteristics of the filter are influenced by a number or 

been reached. If not SCAN 24 ^performed again to obtain such ^ ^ of ^ ^ ^ ^ 

additional scanning data. technique used to fit a line to the recorded points. 

DONE 1 40 'inquires whether ^atain the buffer has been 30 ENTRY 52 receives one or more lines of scanning data as 

processed. If not either ROTATE 34 or PASS 36 is per- m for ed 6c{tc{io n. In preferred embodiments, one fine 

formed until the buffer is empty. Qf daU ^ receiyed 

TERM 42 performs after all image data corresponding to LEFT 54 analyzes the scanning data and establishes 

a page has been passed to a subsequent process. This may whclher m me data represents a "left" edge. In 

mclude turning off the optical sensor, releasing allocated ^ ^ cmbodiment> lhis fa accomplish e d by examining the 

memory and/or notifying a subsequent process that all image scamiflg data from left to right , looking for a transition from 

data has been passed. black {Q white If ^ a fraction is found, the horizontal 

Edge Detection and Skew Angle Calculation location of me transition is ^ m a ^ of recorded " Ieft " 

40 P° mls - T 

The flowchart in FIG. 3b provides additional detail of one pjj $6 establishes the best fit of a straight line to the set 

method to detect edges and calculate a skew angle. Many c f recorded "left" points and establishes a measure of 

variations are possible. Some of these variations are dis- confidence in the fit. In one embodiment, a method which 

cussed below. minimizes the mean squared error between the line and the 

la one embodiment, optical sensor 2 scans a page against 45 recorded points is used to fit the line, but any method of 

a black background. The black background provides a high fitting lines may be used. The mean squared error may also 

degree of contrast with most sheets of paper. A black be used as a measure of confidence in the fit. 

background is not required to practice the present invention, in practice, the location of some of the recorded "left" 

but a background with some optical characteristic that points will deviate significantly from the actual location of 

contrasts with the pages is preferred. 50 the "left" edge. These so-called "bad" points may be handled 

If a contrasting background is not used, other methods in any of several ways. In one embodiment, a given point is 
may be used to provide an indication of the edges. For excluded from the set of recorded "left" points if the distance 
example, a set of electromechanical switches or optical between the given point and the fitted line is greater than 1.2 
switches can be arranged in a manner to sense the location pixels. This distance is obtained from the length of an 
of one or more edges of each page in conjunction with the 55 orthogonal projection from the point to the line. As an 
scanning by the optical sensor. The position information can alternative, the bad points may be kept in the set of recorded 
be included in the scanning data by multiplexing it with the "left** points and the following inquiry is modified accord- 
signals generated by the sensing element. ingly. 

In one embodiment, edge processing analyzes scanning OK 58 inquires whether the measure of confidence in line 

data to record points which correspond to two edges of the 60 fit is sufficient to establish the location of the "left" edge. In 

page. In general, these edges are adjacent to a corner of the one embodiment, the measure of confidence is the number 

page image because the page is seldom aligned exactly with of points in the set of recorded "left** points after all bad 

the optical sensor. As mentioned above, in this embodiment points are excluded. In one embodiment, a minimum of 

the two edges are referred to as "left" and "right" edges even thirty recorded points is required to establish an edge, 

though one of these edges usually corresponds to the top 65 In theory only two recorded points are required to define 

edge of the page image. Referring to FIG. 5, for example, the a line but, because of various sources of uncertainty in actual 

page is skewed approximately 10 degrees in a counter- practice, more recorded points are required to define the line 
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confidently.* This uncertainty is illustrated in FIG. 6 which degrees, the angle 0 between second reference orientation 

provides an enlarged view of a portion of an image. Each dot 320 and "left" edge 322 is established as the skew angle. A 

such as dot 200 represents one pixel. The colored dots rotation of +30 degrees compensates for skew, and the "left" 

correspond to "black" pixels and the uncolored dots corre- edge becomes the actual left edge of the image, 

spond to "white" pixels. The number of pixels processed is 5 Referring to FIG. 9c, the angle 8 between first reference 

generally very large, but only a small number of pixels are orientation 330 and established "right" edge 332 is approxi- 

shown for convenience. The pixels shown in the figure were mately 30 degrees. This is the skew angle for this page 

generated by scanning a page having orthogonal edges image. Note that the image must be rotated by +30 degrees 

defined by lines 202 and 204. The same set of pixels could to compensate for skew, and that the "right" edge becomes 

also be generated from a scan of a page having orthogonal 10 the top edge of the image. 

edges defined by lines 203 and 205. The large difference Referring to FIG. 9d, the angle between the first reference 

between the two sets of lines represents the high degree of oricnUtion not shown , aod « righr ^ge 342 fc a pp roxi _ 

uncertainty in establishing an edge using only a few mate]y sixty degreeg Because tfais ^ ^ lhan 

recorded points. forty-five degrees, the angle 0 between second reference 

FIG. 7 provides a similar view to that shown in FIG. 6. 15 or j entat i oa 340 and "right" edge 342 is established as the 

Addition scanning lines provide additional recorded points s ^ cw Arotation of -30 degrees compensates for skew, 

for each edge, thereby reducing the uncertainty in the and ^ « right « cdgc becomes the actual right edge of the 

location of these edges. Line 210 represents a reference image 

orientation^ The angle between reference orientation 210 Iq ^ embodiment, only one edge is used to 

and edge 202 corresponds to the amount of skew in the 20 ^ ^ ^ ^ ^ fa subsequemly 

1115 age * rotated in a manner that aligns that edge with one of two 

In an alternative embodiment in which bad points are kept re f erenC e orientations. This feature provides two advan- 

in the set of recorded "left" points, the required measure of teges> first advantage is illustrated in FIG. 8 which 

confidence should be relaxed. provides an enlarged view of a portion of an image. Line 202 

If a sufficient measure of confidence is achieved, SKEW represents a ragged edge. Although attempts to fit a straight 

66 is performed next; otherwise, RIGHT 60 is performed uoe to this edge will very likely fail, it is likely that a straight 

next. The order of processing for the two edges is not ii Qe can ^ fi tted to the pixels marking edge 204. After the 

critical. In an alternative embodiment, analysis of the scan- angle between the "right" edge and the second reference 

ning data for the "right" edge may be performed prior to or ^ orientation is established, skew will be compensated cor- 

coincident with analysis for the "left" edge. rectly by rotating the image in a manner similar to that 

RIGHT 60 analyzes scanning data and establishes described above for the example shown in FIG. 9cL 

whether any transition in the data represents a "right" edge. second advantage arises from the fact that no assump- 

In one embodiment, this is accomplished by examining the t£ on need be made about me between the two edges 

scanning data from right to left, looking for a transition from 35 processed by edge detection. In fact, if the top edge of a 

black to white. If such a transition is found, the horizontal rectangular page is aligned with the sensing element, the two 

location of the transition is saved in a set of recorded "right" edg es will be parallel and skew compensation will still 

P omls - function correctly. In this unusual case, the angle between 

y 7 FIT 62 establishes the best fit of a straight line to the set either the "left" edge or the "right" edge and the second 

I of recorded "right" points and establishes a measure of ^ reference orientation will be zero. No skew compensation 

J confidence in the fit. Refer to the description of FIT 56 above be applied. 

Lfor additional details. LIMIT 68 sets an indication that the limit position estab- 

OK 64 inquires whether the measure of confidence m line by UMIT 26 (see FIG. 3a) has been reached. This 

fit is sufficient to establish the location of the "right" edge. effectively prevents EDGE 28 from performing again during 

Refer to the description of OK 58 above for additional 4S me SCSLU 0 f ^ pa g e 

cS SUfi T Dt < l oonfidenoe * achieved ' EXIT 70 terminates this iteration of edge detection and 

iKtW 00 is pertormed next; otnerwise, processing pro- skew 

angle calculation and returns to allow BUFFER 30 

Ce6 ^™,« ! , , , u ...a. , (see FIG 3a) to perform next. 

SKEW 66 calculates the skew angle by establishing the 

\ angle between the detected edge and a reference orientation. 50 Image Rotation 
* Ih preferred embodiments, two mutually orthogonal refer- 
ence orientations are used so that the skew angle is never An image may be rotated by applying a rotational trans- 
more than forty-five degrees. The first reference orientation form to cach P* cl - ^ transformation may be expressed by 
is parallel to the principal axis of the linear CIS array in rotation equations 
- optical sensor 2 The second reference orientation is 55 ^-^xos e.^ (1) 
orthogonal to the first reference orientation and parallel to 

the plane of the page image. y^-ix^x^ sin Q+(y x -y 0 yoos e+y 0 (2) 

Referring to FIG. 9a, the angle 8 between first reference . 

orientation 310 and "left" edge 312 is approximately 30 where (Wo)=cartesian coordinates of the center of rotation, 

degrees. This is the skew angle for this page image. Note that 60 (Xj,y ^cartesian coordinates of a given pixel before 

the image must be rotated by -30 degrees to compensate for rotation, 

skew, and that the "left" edge becomes the top edge of the (x^yj^cartesian coordinates of a given pixel after 

image. The negative angle denotes rotation in a clockwise rotation, and 

direction. 8=skew compensation angle. 

Referring to FIG. 9/>, the angle between the first reference 65 The rotation may be accomplished by moving each pixel 

orientation, not shown, and "left" edge 322 is approximately by appropriate horizontal and vertical displacements, but 

sixty degrees. Because this angle is greater than forty-five this method requires considerable computational resources. 
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The horizontal displacement Ax=(x 2 -x 1 ) and the vertical As scanning data is passed from the buffer to a subsequent 

displacement Ay=(y 2 -y 1 ) for each pixel may be obtained process, ROTATE 34 delays each column j by an amount 

from a pair of displacement equations derived from the corresponding to the respective vertical displacement Ay,, 

rotation equations as follows: effectively applying the second shear, and applies the third 

5 shear to the resultant row k by displacing it horizontally by 

Ax*(*i-V(«w e-iHy^-sin e (3) an amount equal to Ax*, 

Ay-fo-a^-sin 6-Cv^VqHcos 6-1). (4) This implementation is effective only if the vertical dis- 

, , placement for all pixels is always less than or equal to zero. 

In principle the center of rotation may be established at M menlioned above> the chosen ^nto of rotation ensures 

any point but the choice of center can have practical lmph- 1Q tnat ^ 

cations in various embodiments. In the preferred This choice of center introduces two other aspects! One 

embodiment, the center of rotation is established at either the ^ mat ^ ^ a rolated { ^ 

upper right corner of frame 100 (see FIG. 5) for a counter- olitfm|e from oulside me image frame. Referring to FIG. 5, 

clockwise rotation if the skew compensation ang e is for example> phantom pixels to the right of tD6 right ^g,. 

positive, or the upper left corner of frame 100 for a clock- 1S and abow the , ed . of framc m ^ be rotated ^ the 

wise rotation if the skew compensation angle is negative final ; , n the preferred embo diment, these pixels will 

This choice ensures that the vertical displacement for all be ^ cn characteristics matching thos6 of ^ 

pixels in the image is always less than or equal to zero. An background . In an alternative embodiment, these pixels may 

advantage of this result is discussed below. ^ jven characteristics matcb ing those of neighboring pix- 

A number of methods are known in the art which require 2Q ^ 

fewer processing resources than are required to apply either ^ uami t jg tnat , rtion of ^ 102 

the rotauon or the displacement equations above. A method ^ rotated ^ d [he ^ of frame lfl0 Referring , 0 mG _ 

disclosed in U.S Pat No. 4,953,230, cited above, shafts 9a> fof e le> the center of rotation Ues on Une 

vertical columns of pixels within the unage representation to 3W ^ a • t vertically ^ me upper left of 

correct for skew. Unfortunately, this method will severely M ^ • e , n ^ embodiment describe above> this 

distort the image unless the skew angle is very small, A jnt ^ coillcide5 ^ the uppcr left of the 

method disclosed in U.S. Pat No. 5,027,227 cited above, frame skew compensation ^ rotatc a portion o£ this pag6 

divides the image into areas of pixels and shifts the areas in ^ ^ d me , efl ^ of , he j &ame ^ ^ ^ 

parallel, both horizontally and vertically to correct for skew. avoide<J b ^ additional memory to extend the bound- 

This method will also severely distort the image unless the J0 aries of the frame( or by ensuring ^ me optica , 

skew angle is small. _ scanner that the medium does not get too close to the limits 

Apreferred rotational technique is a "three-shear" method of ^ ical Mgsi elements, 
discussed more fully in a paper by A. W. Paeth entitled "A Alternatives to the preferred rotational technique and 

Fast Algorithm for General Raster Rotation, published I in ^pi^ent^on may ^ ^ w y bmt departing from the 
Graphics Gems, A. S Glassner, ed., 1990, pp. 179-195, JS J of te > mvention 
which is incorporated herein by reference in its entirety. In 

concept, the first shear moves all pixels in a respective row Alternative Skew Compensation Method 

i horizontally by a distance equal to Ax,-. After the pixels in 

all rows are moved, the second shear moves all pixels in a The flowcharts in FIGS. 4a-46 illustrate steps in a method 

respective column j vertically by a distance equal to Ay,-. ^ which is an alternative to that shown in FIGS. 3a-36 and 

After the pixels in all columns are moved, the third shear discussed above. Steps in this alternative method which 

moves all pixels in a respective row k horizontally by a correspond to steps in the first method discussed above have 

distance equal to Ax^ The horizontal and vertical displace- like reference numbers in the respective figures. A more 

meats for the three shears are calculated from the expres- detailed discussion of these corresponding steps is provided 

sions above and is not repeated here. 

45 

INIT 22 is substantially similar to that discussed above. 
cots - 1 9 (5) This step prepares various elements for the following steps. 

A* = 0-1 ->-o)— * -Cvi -yo)- ^ j n p ar ticular, for the embodiment discussed below, this 

includes clearing a buffer of all scanning data and setting 
50 indications that a skew angle has not yet been calculated, a 
4y-(*i-*c)*in (*) scanning limit has not yet been reached, the end of the page 

where the approximations are valid for small values of 9 P°} been reached > no " p ^ ket " haS * een 

expressed in radians. formed, and no points in proximity to page edges have been 

recorded. 



In the preferred embodiment, ROTATE 34 (see FIG. 3d) 



does not rotate the entire image at once. Scanning data 55 SCAN 24 is substantially similar to that discussed above; 

received before the skew angle is established is stored in the Preferably, SCAN 24 receives scanning signals, from an 

buffer without modification. When the skew angle is optical sensor in increments no larger than a "packet," 

established, ROTATE 34 calculates the appropriate vertical discussed below, so that the amount of memory needed to 

displacement Ay, for each column j in the image, and store the scanning data is reduced. In a preferred 

calculates the appropriate horizontal displacement Ax t - for 60 embodiment, each increment of scanning data comprises a 

each line or row i of scanning data already stored in the smgle Une of binary pixels indicating whether corresponding 

buffer. ROTATE 34 applies the first shear to each of the rows portions of the scanning signal represent "white" or "black" 

already stored. Thereafter, ROTATE 34 calculates the hori- images. 

zontal displacement Ax, for each line or row i of scanning LIMIT 26 establishes a position in the scanning data for 

data as it is stored in the buffer and applies the first shear to 65 a given page after which skew compensation will not be 

that row by displacing it horizontally by an amount equal to performed if no page edge has been detected. A first 

Ax f . "packet" of scanning data is assembled into a buffer while an 
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attempt is made to detect one or more edges of a page image. cussed above, the method illustrated in FIG. 4b processes 

If at least one edge is detected, the first packet in the buffer points for one edge at a time and is unaware of "left" and 

and all subsequent packets for this page are modified to "right" edges. In one software embodiment, the logic of FIG. 

compensate for skew. If an edge cannot be detected, the first *b is implemented as a subroutine which is invoked once for 

packet in the buffer and all subsequent packets for this page 5 eac ° P° int corresponding to either a "left" or a "right" edge, 

are passed unaltered to a subsequent process. LIMIT 26 ^ calUn g routine obtains P oiDls in response to transitions 

establishes the position in the scan of a given page at which detected m the scanning data and controls the flow of 

edge detection is abandoned and the packets begin to pass processing for each respective edge, 

unaltered. In one embodiment using the optical sensor MIN 53 inquires whether the number of points recorded 

described above, the limit is set at sixty-four scanning lines. 10 for f he K r T espec r f/ eq " aIs or exceeds u a 

. number N^. If fewer than this minimum number of points 

EDGES 28 analyzes scanning data to detect one or more has ^en recorded, it is unlikely that the edge can be 

edges of the page. In one embodiment, scanning data is established accurately. EXIT 70 is performed next if the 

analyzed to detect transitions corresponding to "left" and number of recorded points is less than N^. In one 

"right" edges. As described above, one of these edges embodiment, is equal to five, 

generally corresponds to the top edge of the page image. EDGE 55 inquires whether the distance between the first 

Transitions corresponding to each edge may be processed and lasl recorded points for the respective edge equals or 

using the method described below and illustrated in FIG. 4b. exceeds a threshold distance D MJN . If the first and last 

BUFFER 30 temporarily stores scanning data in the recorded points span a distance less than this threshold 

buffer. The size of the buffer required to hold this data is amount, it is unlikely that the edge can be established 

affected by the operation of LIMIT 26 described above and 20 accurately. EXIT 70 is performed next if the distance is less 

PACKET 31 described below. than D^. In one embodiment using the optical sensor 

PACKET 31 inquires whether enough scanning data has described above, V MW is forty-eight pixels which is sub- 
been assembled into the buffer to form a "packet." In one . stantially equal to one-quarter inch, 
embodiment, a packet constitutes sixty-four scanning lines. ^ In one embodiment, the distance d k between the first 
If insufficient scanning data has been assembled to form a recorded point (x^Vj) and the kth recorded point (X^Y*) is 
packet, SCAN 24 is performed again to obtain additional established from the expression 
scanning data. When enough scanning data has been 

assembled to form a packet, CALC 33 is performed next if i . — . (7) 

it is the first packet for this page; otherwise, SKEW 32 is 3Q * = V + ( ykY 
performed next. 

CALC 33 is similar to SKEW 66 discussed above. This where Ax^Xj-xJ, 

step calculates the skew angle for this page if at least one Ay^jyj-yJ, and 

edge has been detected; otherwise the skew angle is not k>i. 

calculated. If more than one edge has been detected, the 35 \ a another embodiment, the distance d^ is estimated from 

angle is calculated using the edge which is most nearly the expression 
horizontal. Use of this edge is preferred because it is 

generally established with greater accuracy. In other d^MAX^*, Ay*) (8) 

embodiments, the edge corresponding to the line fit with the where j )=the iarger of . me two arguments i and j. 

smallest error is chosen for the angle calculation. 40 The csiim ^ obtained from expression 8 may be preferred 

In preferred embodiments, the skew angle is reset to zero to the more accurate result obtained from expression 7 in 

if the calculated angle exceeds approximately twenty so me embodiments because fewer processing resources are 

degrees because it is unlikely that the actual skew angle will required. In either embodiment, EDGE 55 obtains distance 

exceed twenty degrees; therefore such a large angle is more ^ and establishes whether d* equals or exceeds distance 

likely the result of error rather than the result of excessive 45 D MW . If not, EXIT 70 is performed next; otherwise, FIT 56 

skew. is performed next. 

SKEW 32 inquires whether a skew angle has been sue- In yet another embodiment, the functions of MIN 53 and 

cessfully calculated for this page. If it has, the packet in the EDGE 55 may be combined into a single step which ensures 

buffer is processed by ROTATE 34; otherwise, the packet is that a requisite number of points have been recorded before 

processed by PASS 36. 50 FIT 56 is performed. This may be accomplished in a number 

ROTATE 34 applies a rotational transform to packets of of ways. One way is to calculate the required number of 

scanning data in the buffer. Rotational transforms are dis- recorded points n from the expression 
cussed in more detail above. The results of the transform are 

passed to a subsequent process such as size/shape detector 6. _ MIN fC m ^ + A D N | < 9 > 

PASS 36 passes packets of scanning data to a subsequent 55 11 A* J ' * J 

process without applying a rotational transform. 

END 38 inquires whether the end of the scanning data has where m =0.6875 

been reached. If not, SCAN 24 is performed again to obtain ^.o ^2 anc j 

additional scanning data. 6Q MIN(i, j)=the smaller of the two arguments i and j. 

TERM 42 is substantially similar to that discussed above. According to this expression using the indicated coefficients, 

The flowchart in FIG. 4b provides additional detail of a j$ MW (5) recorded points are required for Ay^/Ax^O, and 

method to detect an edge of a page image. Some variations (43) recorded points are required for Ay^/Ax*- 1.303; 

in this method are also discussed below. ^max recorded points are required for skew angles greater 

ENTRY 52 records one point obtained in response to a 65 than approximately 52.5 degrees. 

line of scanning data which is believed to correspond to a Neither the expressions nor the coefficient values are 

page edge. Unlike the method illustrated in FIG. 3b dis- critical to the practice of the present invention. These 
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expressions are shown to illustrate several ways in which a Referring to FIG. 5, for example, page image 102 repre- 

minimum number of recorded points can be established for sents a page which is skewed counterclockwise. The scanner 

those embodiments which require a minimum number of should not be shut off until the lower left-hand comer of the 

recorded points before attempting to fit a line. page has been scanned. A switch located near the right-hand 

FIT 56 is similar to that discussed above. This step fits a 5 edge of the page will sense the trailing edge before the end 

straight line to a set of recorded points associated with the of the page passes; therefore, the switch will prematurely 

respective edge. indicate the end of the page and possibly cause the scanner 

FOUND 61 inquires whether the respective edge has been to shut off too soon, 

declared "found." If it has been found, ALL 57 is performed The end of a page may be detected more accurately by 

next; otherwise, OLD 59 is performed next. 1Q using additional switches to sense the trailing edge at 

ALL 57 establishes a measure of error for each recorded various points across the width of the page, but each 

point relative to the line fitted by FIT 56. In one additional switch increases the cost of implementing the 

embodiment, the measure of error for each recorded point is imaging system. An imaging system incorporating various 

the distance between the respective point and the fitted line. aspects of the present invention can accurately detect the end 

This distance is obtained from the length of an orthogonal of a page without using additional switches, 

projection from each respective point to the line. Other 1S [ 0 one embodiment, one switch is used to sense both 

measures of error including those discussed above may be leading and trailing edges. Sensing the leading edge initiates 

used in other embodiments. scanning. Sensing the trailing edge initiates a process which 

OLD 59 establishes a measure of error for "old" recorded detects the end of the page. This process analyzes scanning 

points relative to the line fitted by FIT 56. "Old" recorded data and establishes whether any transition in the data 

points are all recorded points excluding the latest point 20 represents an edge of the page. If no transition can be found, 

recorded by ENTRY 52. This exclusion allows the latest the process may conclude that the end of the page has passed 

recorded point to influence the line fit until such time as the the optical sensor. Alternatively, the process may require that 

respective edge is deemed to be "found" by preventing no transitions be found in some minimum number of scan- 

DISCARD 67, discussed below, from discarding it from the ning lines before an end of page is indicated, 

set. 25 The detection of leading edge and end of page may be 

OK 58 inquires whether the largest error measure estab- combined with automatically initiated document image pro- 

lished by either ALL 57 or OLD 59 is less than a margin of cessing as described in U.S. patent application Ser. No. 

error E^. If the largest error measure is less than the 07/922,169 filed Jul. 29, 1992, incorporated herein by ref- 

margin, FOUND 63 is performed next; otherwise, MAX 65 erence in its entirety. Such a combination can provide for an 

is performed next. In one embodiment, the margin of error 30 eflBcient and convenient way to initiate and terminate docu- 

is 1.2 pixels. ment ima g e processing. 

FOUND 63 sets an indication that the respective edge has Size/Shape Detection 

been found. This indication is tested by FOUND 61, dis- „ ¥ ^, . , , . , ... . „ 

j . FIG. 10 is a hypothetical graphical representation of a 
cussed above 

w*v *c • ' • u *u *u . r * j j *c page image. The image within frame 100 comprises page 

MAX 65 inqmres whether the number of points recorded 35 f B 4 * , + * , , r ^ • 

f .u — • , . * u xt image 102 and area 104 outside the edges ot the page image, 

for the respective edge exceeds a maximum number N„ A:r . Y & . . . , . f. . . • 

lf . , - j j . . . t . . - * In many imaging systems, it is desirable to reduce the size 

If the number of recorded points in the set is less than or - . & J . ' . . _ 

equal to EXIT 70 is performed next; otherwise «*■ 104 bec fZ u D ° """^ lnfonnatlon f 

DISCARD 67* performed to keep the size of the set equal about ° f thc P a f but lt m " ciiSCS th f 6 " mount of 

to -JW By limiting thc size of the set, the amount of « space required to store the image. Aspects of the present 

• j » r j i i 4 invention directed toward size/shape detection attempt to 
processing required to fit lines and calculate error measures 

i l i • . j t t_ j* * xt • i . minimize this area. 

may also be limited. In one embodiment, N^^ is equal to . , „ rt <x 

forty eight ^ n P re f errec * embodiments, optical sensor 2 (see FIG. 1) 

DISCARD 67 identifies the recorded point which has the ^ a P a S e a black background. A black background 

largest measure of error and discards it from the set of 45 * not required, but a background that contrasts with page 

recorded points edges is preferred. If a contrasting background is not used, 

EXIT 70 terminates this iteration of edge detection. olher methods such as those described above for skew 

compensation may be used to provide an indication of the 

End of Page Detection edges. For ease of discussion, the following assumes that the 

Edge detection processing similar to that discussed above so image within frame 100 is formed by an optical sensor 

also allows a document imaging system to more accurately scanning a white page with a black background. Preferably, 

detect the end of a document page. This improvement can be a low-pass filtered representation of the image is used to 

important in imaging systems which attempt to turn off the detect edges. 

scanner immediately after the end of a page has been One embodiment of size/shape detection comprises the 

scanned. 55 steps of: (1) establishing points in proximity to the edges of 

Many feed-through or roller-type document imaging sys- the page image, (2) fitting lines through the points to 

tems such as fax machines use one or more switches to sense establish a closed contour which encloses the page image 

the leading edge of a page to turn on a scanner and to sense but does not cross any edge, and (3) defining a polygon with 

the trailing edge of the page to turn off the scanner. In one sides coincident with the lines in the contour. The polygon 

embodiment, a single switch near the point of page insertion 60 is considered to define the size and shape of the page image, 

senses both edges. The insertion of a page trips the switch In one implementation, step (2) defines a line along each 

and causes the scanner to be turned on. The switch also edge of frame 100 and examines the pixels along the fines, 

senses the passing of the trailing edge and causes the scanner Each line is moved progressively into the image away from 

to be turned off after a pre-established interval. the respective frame edge until it encounters an edge of page 

Unfortunately, the switch may incorrectly indicate the end of 65 image 102. When each line has encountered a page image 

page and cause the scanner to shut off too soon if the page edge, step (3) defines a polygon with sides coincident with 

is severely skewed or has an irregular shape. the four lines. 
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FIG. 10 illustrates the results of this process. Lines 502, 
504, 506 and 508 are moved away from their respective 
frame edge until encountering an edge of page image 102. 
Polygon 500, with sides coincident with the four lines, is 
considered to define the size and shape of the page image. 5 
The polygon can be substantially coincident with the edges 
of the page image only if each edge of frame 100 is parallel 
with a respective edge of the page image. 

There are a number of ways in which an edge can be 
detected. One way examines the pixels along a line and 1Q 
concludes that an edge has been encountered if the number 
of white pixels exceeds a threshold/A second way examines 
groups of pixels along a line and concludes that an edge has 
been encountered if any group-has a number of "white" 
pixels which exceeds a threshold. In one particular imple- 
mentation of this second way, the number of "white" pixels 15 
in each 8x8 group of sixty-four pixels just inside the line is 
compared against a threshold value of fourteen. In this 
context, it should be understood that a "white" pixel refers 
to a pixel which deviates significantly from the optical 
characteristics of the background. 20 

These two ways low-pass filter the image. Other ways 
may apply a variety of filters including two-dimensional 
non-recursive filters. One example is a sparse five-tap two 
dimensional filter with a center tap and four taps in each of 
the four cardinal directions. Although the choice of filter 25 
may have significant impact upon system performance, no 
particular filter or filtering method is critical to the practice 
of the present invention. In an embodiment of a document 
imaging system incorporating aspects of the present inven- 
tion directed to both skew compensation and size/shape 30 
detection, it is also possible to use the same low-pass filtered 
version of the image for both skew compensation and 
size/shape detection. 

In another implementation which requires considerably 
more processing resources, step (2) establishes a set of 35 
points for each edge. A respective line is fit to each set of 
points. Apoint is discarded from a set if the distance between 
the given point and the respective fitted line exceeds a first 
threshold. The discarded point is added to an adjacent set of 
points if the distance between that point and the line fitted to ^ 
the adjacent set does not exceed a second threshold. A set of 
points is considered adjacent to the discarded point if it 
meets three criteria: (1) it is not the set from which the point 
was just discarded, (2) it contains the closest of all points to 
the discarded point except for possibly those points in the set 45 
from which the point was just discarded, and (3) the distance 
between this closest point and the discarded point does not 
exceed a third threshold. If no adjacent set exists, the 
discarded point is excluded from further consideration. A 
discarded point will not be added to a given set more than 50 
twice. 

This process continues until a measure of confidence in 
the fit for each set of points is sufficiently high. FIG. 11 
illustrates the results of this process. Lines 502, 504, 506 and 
508 are fit to a respective set of points for each edge. 55 
Polygon 500, with sides coincident with the four lines, is 
considered to define the size and shape of the page image. 
Even though the skew of page image 102 has not been 
compensated, the polygon is essentially coincident with the 
edges of the page image. go 

We claim: 

1. A method comprising the acts of: 

optically scanning a medium to generate sea ruling signals, 

wherein said medium has a length, 
analyzing said scanning signals to detect one or more 65 

edges of said medium prior to scanning the entire 

length of said medium, 
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calculating a skew angle between one of said edges and a 
reference orientation prior to scanning the entire length 
of said medium, and 

modifying said scanning signals while scanning said 
medium to correct for skew. 

2. A method according to claim 1 that scans said medium 
in front of a background provided by a roller having a shade 
of grey within a range from one hundred percent to about 
thirty percent. 

3. A method according to claim 1 that modifies said 
scanning signals only if said skew angle is less than about 
twenty degrees. 

4. A method according to claim 1 that analyzes said 
scanning signals to obtain three or more points in proximity 
to each of said one or more edges, and establishes one or 
more fitted lines by fitting each of said lines to three or more 
of said points, wherein each of said fitted lines corresponds 
to a respective one of said one or more edges. 

5. A method according to claim 4 that establishes two or 
more fitted lines, establishes a confidence level of fit for each 
fitted line, selects the edge corresponding to the fitted line 
having a high confidence level of fit, and calculates said 
skew angle between said selected edge and said reference 
orientation. 

6. A method according to claim 5 that establishes a 
confidence level of fit for each of said fitted lines and 
modifies said scanning signals only if at least one confidence 
level of fit is sufficiently high. 

7. A method according to claim 1 that modifies said 
scanning signals according to a three-shear technique to 
effect a rotation, wherein a center of rotation is chosen such 
that vertical displacements required to effect said rotation 
are less than or equal to zero. 

8. A method according to claim 1 that modifies said 
scanning signals to effect a rotation about a center of 
rotation, wherein said center of rotation is chosen such that 
vertical displacements required to effect said rotation are 
less than or equal to zero. 

9. A document imaging system comprising: 

an optical sensor having an output terminal that conveys 
scanning signals representing a medium, wherein said 
medium has a length, 

a computing system coupled to said output terminal and 
having a processor, wherein said processor analyzes 
said scanning signals to detect one or more edges of 
said medium prior to said optical sensor scanning the 
entire length of said medium, calculates a skew angle 
between one of said edges and a reference orientation 
prior to scanning the entire length of said medium, and 
modifies said scanning signals while said optical sensor 
scans said medium to correct for skew. 

10. A document imaging system according to claim 9 
wherein said optical sensor scans said medium in front of a 
background provided by a roller having a shade of grey 
within a range from one hundred percent to about thirty 
percent. 

11. A document imaging system according to claim 9 
wherein said processor analyzes said scanning signals to 
obtain three or more points in proximity to each of said one 
or more edges, and establishes one or more fitted lines by 
fitting each of said lines to three or more of said points, 
wherein each of said fitted lines corresponds to a respective 
one of said one or more edges. 

12. A document imaging system according to claim U 
wherein said processor establishes two or more fitted lines, 
establishes a confidence level of fit for each fitted line, 
selects the edge corresponding to the fitted line having a high 
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confidence levei of fit, and calculates said skew angle 
between said selected edge and said reference orientation. 

13. A document imaging system according to claim 11 
where in said processor establishes a confidence level of fit 
for each of said fitted lines and modifies said scanning 5 
signals only if at least one confidence level of fit is suffi- 
ciently high. 

14. A document imaging system according to claim 9 
wherein said processor modifies said scanning signals 
according to a three-shear technique to effect a rotation, 10 
wherein said processor selects a center of rotation such that 
vertical displacements required to effect said rotation are 
less than or equal to zero. 

15. A document imaging system according to claim 9 
wherein said processor modifies said scanning signals to IS 
effect a rotation about a center of rotation, wherein said 
processor selects said center of rotation such that vertical 
displacements required to effect said rotation are less than or 
equal to zero. 

16. A method comprising: 20 
step for generating scanning signals representing aa 

image of a medium, wherein said medium has a length, 
step responsive to said scanning signals for detecting one 
or more edges of said medium prior to generating 
scanning signals for the entire length of said medium, 



step responsive to said scanning signals for determining 
skew between one of said edges and a reference ori- 
entation prior to generating scanning signals for the 
entire length of said medium, and 

step for correcting said skew while generating said scan- 
ning signals. 

17. A method according to claim 16 wherein said one or 
more edges are detected by fitting each of one or more lines 
to three or more points in proximity to each of said one or 
more edges, wherein each of said fitted lines corresponds to 
a respective one of said one or more edges. 

18. A method according to claim 17 that fits two or more 
fitted lines, selects the edge corresponding to the fitted line 
having a high confidence level of fit, and determines said 
skew for said selected edge. 

19. A method according to claim 18 that corrects said 
skew only if said confidence level of fit is sufficiently high. 

20. A method according to claim 16 that compensates for 
said skew by effecting a rotation about a center of rotation, 
wherein said center of rotation is chosen such that vertical 
displacements required to effect said rotation are less than or 
equal to zero. 
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